Vakati and Fernandez-Baca Algorithms for Molecular Biology 2014, 9:1 3 
http://www.alnnob.Org/content/9/l/13 



AMB 



ALGORITHMS FOR 
MOLECULAR BIOLOGY 



RESEARCH 



Open Access 



Characterizing compatibility and agreement of 
unrooted trees via cuts in graphs 



Sudheer Vakati and David Fernandez-Baca 



Abstract 

Background: Deciding wlietlier tliere is a single tree — a supertree — tliat sunnnnarizes tlie evolutionary information 
in a collection of unrooted trees is a fundamental problem in phylogenetics. We consider two versions of this 
question: agreement and compatibility. In the first, the supertree is required to reflect precisely the relationships 
among the species exhibited by the input trees. In the second, the supertree can be more refined than the input trees. 
Testing for compatibility is an NP-complete problem; however, the problem is solvable in polynomial time when the 
number of input trees is fixed. Testing for agreement is also NP-complete, but it is not known whether it is fixed- 
parameter tractable. Compatibility can be characterized in terms of the existence of a specific kind of triangulation in a 
structure known as the display graph. Alternatively, it can be characterized as a chordal graph sandwich problem in a 
structure known as the edge label intersection graph. No characterization of agreement was known. 

Results: We present a simple and natural characterization of compatibility in terms of minimal cuts in the display 
graph, which is closely related to compatibility of splits. We then derive a characterization for agreement. 

Conclusions: Explicit characterizations of tree compatibility and agreement are essential to finding practical 
algorithms for these problems. The simplicity of the characterizations presented here could help to achieve this goal. 

Keywords: Phylogenies, Supertrees, Compatibility, Agreement, Cuts in graphs, Chordal graphs 



Background 

A phylogenetic tree T is an unrooted tree whose leaves 
are bijectively mapped to a label set C{T), Labels rep- 
resent species and T represents the evolutionary history 
of these species. Let 7^ be a collection of phylogenetic 
trees. We call V a profile, refer to the trees in V as input 
trees, and denote the combined label set of the input trees, 
^(^)' ^0^)' ^ supertree of 7^ is a phylogenetic 
tree whose label set is C(P), The goal of constructing a 
supertree for a profile is to synthesize the information in 
the input trees in a larger, more comprehensive, phylogeny 
[1]. Ideally, a supertree should faithfully reflect the rela- 
tionships among the species implied by the input trees. 
In reality, this is rarely achievable, because of conflicts 
among the input trees due to errors in constructing them 
or to biological processes such as lateral gene transfer and 
gene duplication. 

We consider two classic versions of the supertree prob- 
lem, based on the closely related notions of compatibility 

"Correspondence: fernande@iastate.edu 

Department of Computer Science, Iowa State University, Ames, Iowa, USA 



and agreement. Let S and T be two phylogenetic trees 
where C{T) c C{S) —for our purposes, T would be an 
input tree and S a supertree. Let S' be the tree obtained by 
suppressing any degree-two vertices in the minimal sub- 
tree of S connecting the labels in C{T), We say that S 
displays T, or that T and S are compatible, if T can be 
derived from S' by contracting edges. We say that tree T 
is an induced subtree of S, or that T and S agree, if S' is 
isomorphic to T, 

Let V he 3. profile. The tree compatibility problem asks 
if there exists a supertree for V that displays all the trees 
in V, If such a supertree S exists, we say that V is compat- 
ible and 5' is a compatible supertree for V, The agreement 
supertree problem asks if there exists a supertree for V that 
agrees with all the trees in V, If such a supertree S exists, 
we say that S is an agreement supertree (AST) for V, 

Compatibility and agreement embody different philoso- 
phies about conflict. An agreement supertree must reflect 
precisely the evolutionary relationships exhibited by the 
input trees. In contrast, a compatible supertree is allowed 
to exhibit more fine-grained relationships among certain 
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labels than those exhibited by an input tree. From a biolog- 
ical viewpoint, the differences between compatibility and 
agreement reflect different ways to treat polytomies — i.e., 
nodes of degree greater than three. Compatibility treats 
polytomies as soft facts: if an input tree node has degree 
four or more, it is not because there were multiple simulta- 
neous speciation events, but because there is not enough 
information to resolve the sequence of speciation. Thus, 
if another input tree provides more refined information 
about speciation order, we can use it, provided the infor- 
mation is not contradicted by the remaining input trees. 
Agreement, in contrast, treats polytomies as hard facts. 
Note that compatibility and agreement are equivalent 
when the input trees are binary. 

If all the input trees share a common label (which can 
be viewed as a root node), both tree compatibility and 
agreement are solvable in polynomial time [2,3]. In gen- 
eral, however, the two problems are NP-complete, and 
remain so even when the trees are quartets; i.e., binary 
trees with exactly four leaves [4]. Nevertheless, Bryant and 
Lagergren showed that the tree compatibility problem is 
fixed-parameter tractable when parametrized by number 
of trees [5] . It is unknown whether or not the agreement 
supertree problem has the same property. 

To prove the fixed-parameter tractability of tree com- 
patibility, Bryant and Lagergren first showed that a nec- 
essary (but not sufficient) condition for a profile to be 
compatible is that the tree-width of a certain graph — the 
display graph of the profile (see Section 'Display graphs 
and edge label intersection graphs') — be bounded by the 
number of trees. They then showed how to express com- 
patibility as a bounded-size monadic second-order for- 
mula on the display graph. By Courcelles Theorem [6,7], 
these two facts imply that compatibility can be decided 
in time linear in the size of the display graph. Unfortu- 
nately, Bryant and Lagergrens argument amounts essen- 
tially to only an existential proof, as it is not clear how 
to obtain an explicit algorithm for unrooted compatibility 
from it. 

A necessary step towards finding a practical algorithm 
for compatibility — and indeed for agreement — is to 
develop an explicit characterization of the problem. In 
earlier work [8], we made some progress in this direction, 
characterizing tree compatibility in terms of the existence 
of a legal triangulation of the display graph of the profile. 
Gysel et al. [9] provided an alternative characterization, 
based on a structure they call the edge label intersection 
graph (ELIG) (see Section 'Display graphs and edge label 
intersection graphs'). Their formulation is in some ways 
simpler than that of [8], allowing Gysel et al. to express 
tree compatibility as a chordal sandwich problem. Neither 
[8] nor [9] deal with agreement. 

Here, we show that the connection between separa- 
tors in the ELIG and cuts in the display graph (explored 



in Section 'Display graphs and edge label intersection 
graphs') leads to a new, and natural, characterization of 
compatibility in terms of minimal cuts in the display graph 
(Section 'Characterizing compatibility via cuts'). We then 
show how such cuts are closely related to the splits of 
the compatible supertree (Section 'Splits and cuts'). Next, 
we give a characterization of the agreement in terms of 
minimal cuts of the display graph (Section 'Characterizing 
agreement via cuts'). To our knowledge, there was no pre- 
vious characterization of the agreement supertree prob- 
lem for unrooted trees. Lastly, we examine the connection 
between the triangulation-based and the cut-based per- 
spectives on compatibility (Section 'Relationship to legal 
triangulations'). 

Prelimmaries 

Splits, compatibility, and agreement 

A split of a label set L is a bipartition of L consisting of 
non-empty sets. We denote a split {X, Y} hy X\Y, A split 
is non-trivial if neither of its sets is a singleton; otherwise, 
it is trivial. Let T be a phylogenetic tree. Let e be an edge 
of r. Deletion of e disconnects T into two subtrees T\ 
and If Ii and L2 denote the set of all labels in Ti and 
72, respectively, then is a split of C{T), We denote 
by Ge{T) the split corresponding to edge e of T; if e is a 
leaf edge, then ae{T) is a trivial split. Let E(r) denote the 
set of all splits corresponding to internal edges of T and 
^triviT) denote the set of all (trivial) splits corresponding 
to leaf edges of T, 

A tree T displays a split X|y if there exists an internal 
edge e oiT where Ge{T) = X\Y, K set of splits of a label 
set L is compatible if there exists a tree that displays all the 
splits in the set. It is well-known that two splits A 1 IA2 and 
B\ \B2 are compatible if and only if at least one of Ai H ^1, 
Al n B2, A2 n Bi and A2 0 B2 is empty [10]. Note that a 
trivial split of L is compatible with every split of L. 

Theorem 1 (Splits -Equivalence Theorem [10,11]). Let 

11 be a collection of splits of a label set X that includes all 
trivial splits. Then, E = Ti{T)\JT^triviT) for some phyloge- 
netic tree Twith label setX if and only if the splits in E are 
pairwise compatible. Tree T is unique up to isomorphism. 

Let 5 be a phylogenetic tree and let 1^ be a subset of 
C{S), Then, S\y denotes the tree obtained by suppress- 
ing any degree-two vertices in the minimal subtree of S 
connecting the labels in Y, Now, let T be a phylogenetic 
tree such that C{T) c C{S), Then, S displays T if and 
only if 11(7) c ll{S\c{T))y T and S agree if and only if 
E(r) = E(5|^(r)). 

Cliques, separators, cuts, and triangulations 

Let G be a graph. We represent the vertices and edges of G 
by V(G) and E(G) respectively. A clique of G is a complete 
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subgraph of G. A clique // of G is maximal if there is no 
other clique H' of G where V{H) C V{H'). For any U c 
V{G), G— U is the graph derived by removing vertices of U 
and their incident edges from G. For any F c E{G), G — F 
is the graph with vertex set V(G) and edge set E(G) \F. 

For any two nonadjacent vertices a and b of G, an /^^-Z? 
separator of G is a set U of vertices where U C 1^(G) and 
a and Z? are in different connected components oiG — U. 
An a-b separator U is minimal if for every Zi^ C U, W 
is not an a-b separator. A set U c V(G) is a minimal 
separator if is a minimal ^^^-Z? separator for some non- 
adjacent vertices a and b of G. We represent the set of all 
minimal separators of graph G by Ag- Two minimal sepa- 
rators U and are parallel HG — U contains at most one 
component H where V{H) H Zi^ 7^ 0. 

A connected component H of G — U is full if for every 
u e U there exists some vertex v e H where {u, v} e EiG). 

Lemma 1 ([12]). For a graph G and any U C ViG), U is 
a minimal separator ofG if and only ifG — U has at least 
two full components, 

A chord is an edge between two nonadjacent vertices of 
a cycle. A graph H is chordal if and only if every cycle of 
length four or greater in H has a chord. A chordal graph H 
is a triangulation of graph G if V(G) = ViH) and F(G) c 
£(//). The edges in E{H)\E(G) are called fill-in edges of 
G. A triangulation is minimal if removing any fill-in edge 
yields a non-chordal graph. 

A clique tree of a chordal graph // is a pair (T, B) where 
(i) r is a tree, (ii) ^ is a bijective function from vertices of 
T to maximal cliques of //, and (iii) for every vertex v G 
//, the set of all vertices x oiT where v € B{x) induces a 
subtree in T, Property (iii) is called coherence. 

Let be a collection of subsets of y(G). We repre- 
sent by Gjr the graph derived from G by making the 
set of vertices of X a clique for every X e T. The 
next result summarizes basic facts about separators and 
triangulations (see [12-14]). 

Theorem 2. Let T bea maximal set ofpairwise parallel 
minimal separators ofG and H be a minimal triangulation 
ofG. Then, the following statements hold, 

1. Gjr is a minimal triangulation of G. 

2. Let (T, B) be a clique tree of Gjr. There exists a 
minimal separator F e T if and only if there exist two 
adjacent vertices xandyin T where B(x) r\B(y) = F. 

3. Ah is a maximal set of pairwise parallel minimal 
separators ofG and Gah = 

A cut in a connected graph G is a subset F of edges of G 
such that G—F is disconnected. A cut F is minimal if there 
does not exist F^ C F where G — FMs disconnected. Note 



that if F is minimal, G — F has exactly be two connected 
components. Two minimal cuts F and F^ are parallel if 
G — F has at most one connected component H where 
E(H) n r 7^ 0. 

Display graphs and edge label Intersection graphs 

We now introduce the two main notions that we use 
to characterize compatibility and agreement: the display 
graph and the edge label intersection graph. We then 
present some known results about these graphs, along 
with new results on the relationships between them. Here 
and in the rest of the paper, [ m] denotes the set {1, ... , m}, 
where m is a positive integer. Since for any phylogenetic 
tree T there is a bijection between the leaves of T and 
>C(T), we refer to the leaves of T by their labels. 

Let V = {Ti, • • • , T}^} be a profile. We assume that 
for any e [ k] such that / ^ the sets of internal vertices 
of input trees T/ and Tj are disjoint. The display graph 
of V, denoted by G(V), is a graph whose vertex set is 
Uie[k] ^C^i) ^^^^ set is U;^[/^] EiTj) (see Figure 1). A 
vertex v of G(V) is a leaf if v g C(V), Every other vertex 
of G(V) is an internal. An edge of G(V) is internal if its 
endpoints are both internal. 

A triangulation G' of GfP) is legal if it satisfies the 
following conditions. 

1. For every clique C of G' , if C contains an internal 
edge, then it contains no other edge of G(P), 

2, No fill-in edge in G' has a leaf as an endpoint. 

Theorem 3 (Vakati, Fernandez-Baca [8]). A profile V 
of unrooted phylogenetic trees is compatible if and only if 
G(V) has a legal triangulation. 

In what follows, we assume that GCP) is connected. If it 
is not, the connected components of G{V) induce a par- 
tition of V into sub-profiles such that for each sub-profile 
V , G(V^) is a connected component of G(V). It is easy to 
see that V is compatible if and only if each sub-profile is 
compatible. 

The edge label intersection graph ofV, denoted LG(V), 
is the line graph of G(V) [9], That is, the vertex set of 
LG(G) is E(G(V)) and two vertices of LG(P) are adjacent 
if the corresponding edges in G(V) share an endpoint. 
(We should note that Gysel et al. [9] refer to LG(V) as the 
modified edge label intersection graph.) For an unrooted 
tree T, LG(r) denotes LG({r}). 

Observation !♦ Let F be a set of edges of G(P) and 
let {vi, V2, . . . , v^} c V(G(V)) where m > 2, Then, 
vi, V2, . . . , V;^ is a path in G(V) — F if and only if 
{vi, V2}, . . . , {Vm-i> ^m) is a path in in LG(V) — F, 

Thus, if G(V) is connected, so is LG(V). Hence, in what 
follows, we assume that LGCP) is connected. 
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A fill-in edge for LG(V) is valid if for every T G 7^, at 
least one of the endpoints of the edge is not in LG(r). A 
triangulation H of LG{V) is restricted if every fill-in edge 
of H is valid. 

Theorem 4 (Gysel et al [9]). A profile V of unrooted 
phylogenetic trees is compatible if and only ifLG{V) has a 
restricted triangulation, 

A minimal separator F of LG(7^) is legal if for every T g 
V, all the edges of T in F share a common endpoint; i.e., 
F n £(r) is a clique in LG(r). The following theorem was 
mentioned in [9] . For future reference, we formally state it 
and prove it here. 

Theorem 5. A profile V is compatible if and only if 
there exists a maximal set T ofpairwise parallel minimal 
separators in LG{V) where every separator in T is legal. 

Proof Our approach is similar to the one used by 
Gusfield in [15]. Assume that V is compatible. From 
Theorem 4, there exists a restricted triangulation H of 
LG(7^). We can assume that H is minimal (if it is not, 
simply delete fill-in edges repeatedly from H until it is 
minimal). Let T = A//. From Theorem 2, is a maxi- 
mal set of pairwise parallel minimal separators of LG(7^) 
and LG(7^)jr = H. Suppose T contains a separator F that 
is not legal. Let {e, e!\ c F where {e, e!\ c F{T) for some 
input tree T and e ^ e! = 0. The vertices of F form a 
clique in //. Thus, H contains the edge {e, e!\. Since {e, e!\ 
is not a valid edge, H is not a restricted triangulation, a 
contradiction. Hence, every separator in T is legal. 

Let be a maximal set of pairwise parallel minimal sep- 
arators of LG(7^) where every separator in T is legal. From 



Theorem 2, LG(7^)jr is a minimal triangulation of LG(J^). 
If {e, e'} e E(LGiV) ^) is a fill-in edge, then e 0 = 0 and 
there exists a minimal separator F e F where {e, e'} c F, 
Since F is legal, if {e, e^} c E{T) for some input tree T then 
er\e^ 7^ 0. Thus, e and e^ are not both from LG(r) for any 
input tree T. Hence, every fill-in edge in LG{V)jr is valid, 
and LG{V)jr is a restricted triangulation. □ 

Let of be a vertex of some input tree. We write lnc{u) 
to denote the set of all edges of G(P) incident on u, Equiv- 
alently, Inc(w) is the set of all vertices e of LG(7^) such that 
u e e. 

Let F be a cut of the display graph G{V), F is legal if for 
every tree T e V, the edges of T in F are incident on a 
common vertex; i.e., if F H F(r) c Inc(w) for some u G 
V(T),F is nice if F is legal and each connected component 
of G(V) — F has at least one edge. 

Lemma 2. Let F be a subset of E(G{V)). Then, F is a 
legal minimal separator ofLG(V) if and only ifF is a nice 
minimal cut ofG(V). 

To prove the Lemma 2, we need two auxiliary lemmas 
and a corollary. 

Lemma 3. Let F be any minimal separator ofLG(V) and 
u be any vertex of any input tree. Then, Inc{u) g F. 

Proof Suppose F is a minimal a-b separator of LG(P) 
and is a vertex of some input tree such that lnc{u) c F. 
Consider any vertex e e lnc(u). Then, there exists a path 
TT from a to bin LG(V) where e is the only vertex of F in tt. 
If such a path tt did not exist, then F—e would still be an a- 
b separator, and F would not be minimal, a contradiction. 
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Let ei and 62 be the neighbors of e in tt and let e = {u, v}. 
Since lnc(u) ^ F, n does not contain any other vertex e' 
where u G . Thus, e f^ ei = {v} and e n 62 = {W- Let 
TT = . . . , ei, ^2, . . . , ^. Then = a, . . . ,61,62, . . . ,b is 
also a path from a to Z?. But tt^ does not contain any vertex 
of F, contradicting the assumption that F is a separator 
of LG(V). Hence, neither such a minimal separator F nor 
such a vertex u exist. □ 

Lemma 4. TfF /5 minimal s6parator of LG(V), th6n 
LG(V) — F has 6xactly two conn6ct6d compon6nts. 

Proof, Assume that LG(P) — F has more than two con- 
nected components. By Lemma 1, LG(7^) — F has at least 
two full components. Let Hi and H2 be two full compo- 
nents of LG(7^) — F. Let //s be a connected component 
of LG(7^) — F different from Hi and 7/2. By assumption 
LG(7^) is connected. Thus, there exists an edge {6,63} in 
LG(7^) where e g F and ^3 G 7/3. Since //i and H2 are full 
components, there exist edges {e, ei} and {e, ^2} in LG(7^) 
where ei e V(Hi) and 62 G V^(//2)- 

Let e = {u, v}, and assume without loss of generality that 
u G 6 n 63, Then, there is no vertex/ G V(Hi) where 
u G en/. Thus, V G 6 (~\ 61, Similarly, there is no vertex 
/ G V{H2) such that G / H e or v G / H But then H2 
does not contain a vertex adjacent to so H2 is not a full 
component, a contradiction. □ 

Corollary 1. IfF is a minimal s6parator ofLG(V), th6n 
LG(V) — F' is connected for any F' C F. 

Proof of Lemma 2. We prove that if F is a legal minimal 
separator of LGCP) then F is a nice minimal cut of G{V), 
The proof for the other direction is similar and is omitted. 

First, we show that F is a cut of G(V). Assume the 
contrary. Let {w, v} and {p, q] be vertices in different com- 
ponents of LG(P) — F. Since G{V)—F is connected, there 
is a path between vertices u and q in this graph. Also, 
{m, v} ^ F and {p, q) ^ F. Thus, by Observation 1 there is 
also a path between vertices {u, v] and q} of LG(P) — F. 
This implies that {u, v] and {p, q} are in the same con- 
nected component of LGCP) — F^^l contradiction. Thus F 
is a cut. 

Next we show that F is a nice cut of G(V), For every 
F G P all the vertices of LG(r) in F form a clique in 
LG(F). Thus, all the edges of F in F are incident on a com- 
mon vertex, so F is legal. To complete the proof, assume 
that G(V) — F has a connected component with no edge 
and let u be the vertex in one such component. Then, 
lnc(u) c F. But F is a minimal separator of LG(P), and by 
Lemma 3, lnc(u) ^ F, s. contradiction. Thus, F is a nice 
cut. 

Lastly, we show that F is a minimal cut of G{V), Assume, 
on the contrary, that there exists F' <Z F where G{V) — 



F' is disconnected. Since F' (Z F and every connected 
component of G(7^) — F has at least one edge, every con- 
nected component of GfP) — F' also has at least one 
edge. Let {u, v] and [p, q] be the edges in different com- 
ponents of G{V) - F\ By Corollary 1, LG(7^) - F' is 
connected and thus, there is a path between {w, v} and 
{p,q} in LG(V) — F\ By Observation 1 there must also 
be a path between vertices u and p in G(V) — F^ Hence, 
edges {m, v} and {p^q} are in the same connected com- 
ponent of G — F^ a contradiction. Thus, F is a minimal 
cut. □ 

Lemma 5. Two legal minimal separators F and F' of 
LG(V) are parallel if and only if the nice minimal cuts F 
and F' are parallel in G(V), 

Proof Assume that separators F and F^ of LG(V) are 
parallel, but cuts F and F^ of G(V) are not. Then, there 
exists a set {{w, v}, {p, q}} c F^ where {u, v] and {p, q] are 
in different components of G(V) — F. Since F and F^ are 
parallel separators in LGCP), and F does not contain {w, v} 
and {p, q}, there exists a path between vertices [u, v] and 
{p,q} in LG(P) — F. Then, by Observation 1 there also 
exists a path between vertices u and q in G(P) — F. Thus, 
[u, v} and [p, q} are in the same connected component of 
G(P) — F, a contradiction. 

The other direction can be proved similarly, using 
Observation 1. □ 

The next lemma, from [9], follows from the definition of 
restricted triangulation. 

Lemma 6. Let H be a restricted triangulation ofLG{V) 
and let {T,B) be a clique tree of H Let e = {u,v} be 
any vertex in LG(V). Then, there does not exist a node 
X e V{T) where B(x) contains vertices from bothInc(u)\e 
and Inc(v)\e, 

Lemma 7. Let T be a tree in V and suppose F is a min- 
imal cut of G(V) that contains precisely one edge e of T, 
Then, the edges of the two subtrees ofT — e are in different 
connected components ofG(V) — F, 

Proof Let e = {u,v}. For each x e e, let F^ denote the 
subtree containing vertex in F — e. For each vertex x e e, 
all the edges of Tx are in the same connected component 
of G(P) — F as ^, because e is the only edge of T in F. Since 
F is a minimal cut of G(7^), the endpoints of e are in differ- 
ent connected components of G{V) — F. Hence, the edges 
of Tu and Ty are also in different connected components 
of G(P)-F. □ 

Characterizing compatibility via cuts 

A set T of cuts of GCP) is complete if, for every input tree 
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T e V and every internal edge e of T, there is a cut F e T 
where e is the only edge of T in F. 

Lemma 8. G{V) has a complete set ofpairwise parallel 
nice minimal cuts if and only if it has a complete set of 
pairwise parallel legal minimal cuts. 

Proof The "only if part" follows from the definition of a 
nice cut. Let be a complete set of pairwise parallel legal 
minimal cuts. Consider any minimal subset of F that is 
also complete. Let F be a legal minimal cut of J^' . Since J^' 
is minimal, there exists an edge e e F of some input tree 
T such that e is the only edge of T in F. Also, since e is an 
internal edge, both subtrees oiT — e have at least one edge 
each. Thus by Lemma 7, both connected components of 
G(V) — F have at least one edge each. Hence, F is a nice 
minimal cut of G(P), It follows that is a complete set 
of pairwise parallel nice minimal cuts of G(V). □ 

We now characterize the compatibility of a profile in 
terms of minimal cuts in the display graph of the profile. 

Theorem 6. A profile V of unrooted phylogenetic trees 
is compatible if and only if there exists a complete set of 
pairwise parallel legal minimal cuts for G(V), 

Example 1. For the display graph of Figure 1, let 
T = {Fi,F2.F3.F^i where Fi = {{1, 2}, {5, 6}}, F2 = 
{{2, 3}, {6, 7}, {5, 6}}, F3 = {{4,5},{l,2},{l,c}}^fz^F4 = 
{{6, 7}, {2,/}}. Then, T is a complete set ofpairwise parallel 
nice minimal cuts. 

Theorem 6 has an analog in terms of LG(P). Let us say 
that a set T of legal minimal separators of LG(7^) is com- 
plete if for every internal edge e of an input tree T, there 
exists a separator F e T where e is the only vertex of 
LG(r)inF. 

Theorem 7. A profile V of unrooted phylogenetic trees 
is compatible if and only if there exists a complete set of 
pairwise parallel legal minimal separators for LG{P). 

This result is a direct consequence of Theorem 6 and 
Lemmas 2, 5, and 8, so we omit its proof. Instead, we focus 
on the proof of Theorem 6, for which we need the next 
fact. 

Lemma 9* The following two statements are equivalent. 

1. There exists a maximal set T ofpairwise parallel 
minimal separators ofLG(V) where every separator 
in T is legal. 

2. There exists a complete set of pairwise parallel nice 
minimal cuts for G(V). 



Proof (i) ^ (ii): We show that for every internal edge 
e = {u,v} of an input tree T there exists a minimal sepa- 
rator in that contains only vertex e from LG(T). Then it 
follows from Lemmas 2 and 5 that is a complete set of 
pairwise parallel nice minimal cuts for G(V). 

As shown in the proof of Theorem 5, LG(V)jr is a 
restricted minimal triangulation of LG(V). Let (S,B) be 
a clique tree of LG(V)j^. By definition, the vertices in 
each of the sets lnc(u) and Inc(v) form a clique in LGCP). 
Consider any vertex p of S where Inc(w) c B(p) and 
any vertex q of S where Inc(v) c B(q). (Since (S,B) is a 
clique tree of LGCP) jr, such vertices p and q must exist.) 
Also, by Lemma 6, p ^ q, B(p) fl (Inc(v)\{e}) = 0 and 
B(q) n (lnc(u)\{e}) = 0. 

Let 7t — pyXiyX2y . . -yXyny ^ be the path from p to q in 
S where m > 0. Let xq = p and Xm-\-i = q- Let Xi 
be the vertex nearest to p in path n where / e[m -\- I] 
and B(xi) n ilnc(u)\{e}) = 0. Let F = B(Xi-i) n B(xi). 
Then by Theorem 2, F g J-". Since lnc(u) H Inc(v) = {e}, 
by the coherence property, e G B(xj) for every ; e[m]. 
Thus, e e F.By Lemma 6, B(xi-i) n (Inc(v)\{e}) = 0. 
Since B(xi) n (lnc(u)\{e}) = 0, F n Inc(w) = {e} and 
F n Inc(v) = {e}. Thus, for every vertex e^ e LG(T) where 
e ^ e! and e Pi 7^ 0, ^ F. Also, since every separator 
in T is legal, we have/ ^ F for every vertex/ g LG(r) 
where /He = 0. Thus, e is the only vertex of LG(r) 
inF. 

(i) ^ (ii): Consider any complete set of pairwise parallel 
nice minimal cuts T' of G(V). By Lemmas 2 and 5, J^^ is a 
set of pairwise parallel legal minimal separators of LGCP). 
There exists a maximal set T of pairwise parallel minimal 
separators where c J^, 

Assume that F\P contains a minimal separator F that 
is not legal. Then, there must exist a tree T e V where at 
least two nonincident edges ei = {x,y} and ^2 = {»^^y} 
of T are in F. Consider any internal edge e^ in T where 
ei and 62 are in different components of T — ^3. Such an 
edge exists because ei and e2 are nonincident. Since is 
complete, there exists a cut F^ g J^' where es is the only 
edge of r in F^ Since F and F' are in J^, they are paral- 
lel to each other and vertices ei and e2 are in the same 
connected component of LG(7^) — F\ Thus, by Obser- 
vation 1, there exists a path between vertices x and x^ in 
G(V) — F' and edges ei and e2 are also in the same con- 
nected component of G{P) — F' . But by Lemma 7 that is 
impossible. 

Thus, every separator of is legal and is a max- 

imal set of pairwise minimal separators of LGCP) where 
every separator in T is legal. □ 

Proof of Theorem 6. By Theorem 5 and Lemma 9, profile 
V is compatible if and only if there exists a complete set 
of pairwise parallel nice minimal cuts for G(V). The rest 
follows from Lemma 8. □ 
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Splits and cuts 

We first argue that for every nice minimal cut of G(V) we 
can derive a split of C(V), We use the following notation: 
if // is a subgraph of G(V)y then C(H) represents the set 
of all leaves of H 

Lemma 10. Let F be a nice minimal cut ofG(V) and let 
G\ and G2 be the two connected components ofG(P) — F. 
Then, L(Gi) ^ 0fori e {1,2}. In particular, CiGi)\C(G2) 
is a split ofC{V), 

Proof. Consider Gi for each / G {1, 2}. We show that 
C{Gi) is non-empty. Since F is nice, G/ contains at least 
one edge e of G(V). If e is a non-internal edge, then C{Gi) 
is non-empty. Assume that e = {u, v] is an internal edge 
of some input tree T, If F does not contain an edge of T, 
then C(T) <^ OGi) and thus C{Gi) is non-empty. Assume 
that F contains one or more edges of T, Let Tu, Ty be the 
two subtrees ofT — e. Since F is a nice minimal cut, F con- 
tains edges from either Tu or Ty but not both. Without loss 
of generality assume that F does not contain edges from 
Tu^ Then, every edge of is in the same component as 
e. Since contains at least one leaf, C{Gi) is non-empty. 
Thus, C{Gi)\C{G2) is a split oiC{V). □ 

Let a (F) denote the split of C(V) induced by a nice min- 
imal cut F. If is a set of nice minimal cuts of G(V), S (J^) 
denotes the set of all the non-trivial splits in \^p^-pcr(F). 
The following result expresses the relationship between 
complete sets of nice minimal cuts and the compatibility 
of splits. 

Theorem 8. IfG(V) has a complete set ofpairwise par- 
allel nice minimal cuts T, then E(J^) is compatible and 
any compatible tree for E {T^ is also a compatible tree for 
V, 

Example 2. For the complete set ofpairwise parallel nice 
minimal cuts T = {F\,F2, F3, F/^for the display graph of 
Example 1, we have cr(Fi) = abc\defg, cr(F2) = abcfg\de, 
a(Fs) = ab\cdefg, andaiF/^) = abcdefg. Note that these 
splits are pairwise compatible. 

The proof of Theorem 8 uses the following lemma. 

Lemma 11. Let Fi and F2 be two parallel nice minimal 
cuts ofG(V). Then, cr(Fi) andcr(F2) are compatible. 

Proof Let cr(Fi) = Ui\U2 and a(F2) = Vi\V2^ Assume 
that and (j(F2) are incompatible. Thus, Ui n V/ 7^ 0 
for every G {1,2}. Let a e Ui n Vi, b e Ui n V2, 
c e U2 Vi and d e U2 V2. Since {a,b} c Ui, there 
exists a path tti between leaves a and b in G(V) — Fi. 
But a and b are in different components of G(V) — ^2- 



Thus, an edge ei of path tti is in the cut F2. Similarly, 
{c, d} c LI2 and there exists a path 712 between labels c and 
d in G(V) — Fi. Since c and d are in different components 
of G{V) — F2, cut F2 contains an edge e2 of path 712. But 
TTi and 712 are in different components of G{V) — Fi, so 
edges ei and 62 are in different components of G{V) — Fi. 
Since {^i, ^2} ^ i-2, the cuts Fi and F2 are not parallel, a 
contradiction. □ 

Proof of Theorem 8. The compatibility of ^(J-) follows 
from Lemma 11 and Theorem 1. Let <S be a compatible tree 
for S(F), let T be an input tree of V, let = S\/:(T)f and 
let e be any internal edge of T. We show that displays 

cr(e). 

Let cr(e) = A \B. There exists a cut F e T where e is 
the only edge of T in F. By Lemma 7, since F is minimal, 
the leaves of sets A and B are in different components of 
G(V) - F. Thus, if a(F) = A'\B' then, up to renaming of 
sets, we have A ^A^ and B c B' . Because S displays g{F), 
S' also displays G{e). Since S' displays all the splits of T, T 
can be obtained from S' by contracting zero or more edges 
[10]. Thus, S displays T. Since S displays every tree in S 
is a compatible tree for V. □ 

Characterizing agreement via cuts 

The following characterization of agreement is similar to 
the one for tree compatibility given by Theorem 6, except 
for an additional restriction on the minimal cuts. 

Theorem 9. A profile V has an agreement supertree if 
and only ifGiP) has a complete set T ofpairwise parallel 
legal minimal cuts where, for every cut F and for every 
T eV, there is at most one edge of Tin F. 

Example 3. One can verify that the display graph of 
Figure 1 does not meet the conditions of Theorem 9 and, 
thus, the associated profile does not have an AST. On the 
other hand, for the display graph of Figure 2, let T = 
{Tx.T2.F^\ where Fx = {{1, 2}, {4, 5}}, F2 = {{1, 2}, {5, 6}} 
and F3 = {{2, 3}, {6, d}}. For any given input tree T, 
every cut in T has at most one edge of T. Also, T is 
a complete set of pairwise parallel legal minimal cuts. 
Thus, by Theorem 9, the input trees of Figure 2 have an 
AST. 

The analogue of Theorem 9 for LG(7^) stated next 
follows from Theorem 9 and Lemmas 2, 5, and 8. 

Theorem 10. A profile V has an agreement supertree if 
and only ifLG{V) has a complete set T ofpairwise parallel 
legal minimal separators where, for every F ^ T and every 
T eV, there is at most one vertex ofLG{T) in F. 

Theorem 9 follows from Lemma 8 and the next result. 
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(i) 




(iii) 




(ii) 




(iv) 



Figure 2 Agreeing trees, (i) First input tree, (ii) Second input tree, wliicli agrees witli tine first, (iii) Display grapln of tine input trees, (iv) Edge label 
intersection graph of the input trees, where label uv represents edge {u, v} of the display graph. 



Lemma 12. A profile V has an agreement supertree if 
and only ifG{V) has a complete set T ofpairwise parallel 
nice minimal cuts where, for every cut F e and every 
T eV, there is at most one edge of Tin F, 



The rest of the section is devoted to the proof of 
Lemma 12. 

Let S be an AST of V and let e = {u, v} be an edge of 
S. Let Su and be the subtrees ofS — e containing u and 
V, respectively. Let = C(Su) and Ly = C(Sy), Thus, 
cfeiS) = Lu\Ly, Assume that there exists an input tree T 
where C{T) n 7^ 0 for each x G {w, v}. Then there 
exists an edge / e E(T) where, if crf(T) = A1IA2, then 
^1 ^ Lu and A2 ^ Ly, (If there were no such edge, S\jC{T) 
would contain a split that is not in T and would thus not 
be isomorphic to T.) We call e an agreement edge of S cor- 
responding to edge/ of T, Note that there does not exist 
any other edge of T where e is also an agreement edge 
of S with respect to edge of T, 

The cut function of an AST 5' of 7^ is the mapping ^ 
from E(S) to subsets of edges of G(V) defined as fol- 
lows. For every e G E(S), an edge / of an input tree T 
is in ^ (e) if and only if e is an agreement edge of S cor- 
responding to edge / of T, Observe that ^ is uniquely 
defined. Given an edge e g E(S), we define a set Vx for 
each X e e 2iS follows. For every T e V, let Vx,t consist 
of all the vertices of the minimal subtree of T connect- 
ing the labels in jC(T) H Lx^ Then, Vx = Urep ^x,T' 
Note that if e = [u^v] then {Vu^Vv} is a partition of 
V(G(V)), 

Lemma 13. Let S be an AST ofV and let ^ be the cut 
function ofS. Then, for every edge e e E{S), 



(i) ^(e) is a cut ofG(V) and 

(ii) ^ (e) is a minimal cut of G{V) if and only if 

G(V) — ^(e) has exactly two connected components. 

Proof (i) Let e = {u, v}. We show that G(V) - ^(e) does 
not contain an edge whose endpoints are in distinct sets 
of {Vu> Vv}- Assume the contrary. Let / = {x,y} be an edge 
of G(V) - ^(e) where x e Vu andy g Vy, 

Since / g G{V) — ^(e),f ^ ^(e). Suppose / is an edge 
of input tree T There are two cases. 

1. ^(e) does not contain an edge ofT. Then, there 
exists an endpoint p of e where jC(T) ^Lp. Without 
loss of generality, let u = p. Then, V{T) c 1/^ and 
thus y eVu,^ contradiction. 

2. ^{e) contains an edge f ^f of T. Let f = {r, s} and 
let Lr c Lu and Lg c Ly. Let x,r be the vertices of f 
and/^ where Lx C Lr. Since T is a phylogenetic tree, 
such vertices x and r exist. Since Lr ^ L^, both the 
endpoints of f are in V^^, a contradiction. 

Thus, G{V) — ^(e) does not contain an edge whose end- 
points are in different sets of {V^^, Vy}. Since Vu and Vy are 
non-empty, ^(e) is a cut of G(V). 

(ii) The "only if" part follows from the definition of a 
minimal cut. We now prove the "if" part. Let e = {u,v}. 
Assume that G(P) — ^{e) has exactly two connected com- 
ponents. From the proof of (/), Vu and Vy are the vertex 
sets of those two connected components. Consider any 
edge/ G ^ie). The endpoints of/ are in different sets of 
{K^, Vy} and thus are in different connected components 
of G(7^) - VI/ (e). Hence, G{V) - {^{e)\\f}) is connected. 
Thus, if G(P) — ^{e) has exactly two connected compo- 
nents, ^{e) is a minimal cut of G(P). □ 
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The next observation summarizes two basic facts about 
cut functions. 

Observation 2. Let S be an AST of V. Then, the cut 
function ^ ofS has the following properties, 

1. For any two distinct edges ei and e2 in E(S), 
^(ei) 7^ vl/(e2). 

2. Let e = {w, v} be an edge of S. For any input tree T 
where C{T) ^Lyi^^, all the labels of C{T) n Ly are 
in the same connected component of G(V) — ^(e). 

Let S be an AST of V and let e be an edge of S, Although 
Lemma 13 shows that ^(e) is a cut of G(V), ^(e) may not 
be minimal. We now argue that we can always construct 
an agreement supertree whose cut function gives minimal 
cuts. 

Lemma 14, IfV has an AST, then it has an ASTSofV 
whose cut function ^ satisfies the following: For every edge 
e e S, ^(e) is a minimal cut ofG(V), 

We prove Lemma 14 by arguing that any AST 
that fails to satisfy the required cut minimality prop- 
erty can be transformed into one that does, through 
repeated application of the "splitting" operation, defined 
next. 

Suppose e = (u, v) is a an edge of S where ^(e) is not 
minimal. Let {Li, . . . , L^} be the partition ofLy where for 
every i e[m], Li = C{C) H Ly for some connected compo- 
nent C in G{V) — ^(e). We assume without loss of gen- 
erality that m > 1 (if not, we can just exchange the roles 
of u and v). Let Ry be the rooted tree derived from Sy by 
distinguishing vertex v as the root. Let Ry^i be the (rooted) 
tree obtained from the minimal subtree of Ry connecting 
the labels in Li by distinguishing the vertex closest to v 
as the root and suppressing every other vertex that has 
degree two. To split edge e at u is to construct a new tree 
y from S in two steps: (i) delete the vertices of Ry from S 
and (ii) for every / G [ m], add an edge from u to the root of 

Observation 3, Let S be an AST ofV and let ^ be the 

cut function ofS. Let S^ be the tree derived by splitting edge 
e = {u,v} at u. Consider any connected component C of 
G(V) - ^(e) where C(C) n Ly ^ 0. Then, for every X c 
(£(C) n Ly\ S\x and are isomorphic. 

The next observation follows from the definition of AST. 

Observation 4. Let S and T be two phylogenetic trees 
where C{T) c C{S) and T agrees with S. Then, T 
and S\u agree for every U such that C{T) c U c 
C{S). 



Lemma 15. Let S be an AST ofV and let e = {u, v} be 
an edge ofS. Let S' be the tree derived by splitting edge e at 
u. Then, S' is an AST of V. 

Proof By construction, S' is a phylogenetic tree over 
C(V). As before, let {Li, . . .yLm) be the partition of Ly 
where for every / e[m], Li = C(C) H Ly for some con- 
nected component C in G(V) — ^(e). Consider any input 
tree T of profile V, We prove that T and S' agree. There 
are three cases. 

Case 1: C(T) c Lu. Since C(T) c L^, by Observation 4, 
T and 5|i^ agree. By the definition of the split operation, 
trees S\l^ and S^^j^^ are isomorphic. Thus, T and S' agree. 
Case2:C(T) c Ly. By Observation 2(ii), C(T) c Li for 
some / e[m]. Since T and S agree and C(T) c Li, by 
Observation 4, T and S\Li agree. By construction, trees S\Li 
and 5'!^. are isomorphic. Thus, T and S^ agree. 
Case 3:\C{T) HL^^Q) and (C(T) HLy^ 0). By 
Observation 2(ii), C(T) Pi c Li for some / g[ m]. Since 
T and S agree and C(T) <^ {Lu U L/), by Observation 4, T 
also agrees with ^k^^ul,). By construction, trees 5'|(i^ui/) 
and are isomorphic. Thus, T and agree. 

It follows that T and S' agree. 

Thus,y isan ASTofP. □ 

Observe that if is the tree obtained by splitting edge 
e = {m, v} of 5 at u, then the edges of E(Su) are in both S 
and S\ 

Therefore, £(S)\£(S^) = E(S)\E(S') and£(S0\^(5^) = 
E(S')\E(S), 

Lemma 16. Let S be an AST ofV and let e = {u, v} be 
an edge of S, Let S' be the tree obtained by splitting e at 
u. Let ^, ^' be the cut functions of S and S' respectively. 
Consider any edge f g E{S') \ E{S), There exists an edge 
e' e E(S) \ E(S') where ^'{f) c \\f{e'). Furthermore, if 
^(eO is a minimal cut of G(V) then ^'(f) = and 
^'(f) is a minimal cut ofG(V), 

Proof Let / = {x, y] and let x be the vertex of / where 
Lx ^ Ly. Let Sp be the minimal subtree of S connecting 
the labels in Lx. Let p be the vertex of Sp closest to u in S. 
Let q be the vertex adjacent to p in the path from p to u. 
Let e^ = {p,q}. Note that, Lx ^ Lp. Since Lx ^ Ly, e! is an 
edge oiE(S) \ E(S'). Consider any tree T that has an edge 
fx in "^'{f). We show that C{T) n = C{T) n Lp. It then 
follows that /i G ^{e!) and thus, ^' if) ^ ^(^0- 

Since Lx c Lp, (Lx H C(T)) c (C(T) n Lp). By Obser- 
vation 2(ii), all the labels in C(T) H Ly are in the same 
connected component of G(V) — ^(e). Thus, all the labels 
in Lx ^ (Lp n C(T)) are in the same connected of G(V) — 
vl/(e).lf(l^n/:(r)) 2 (LxnC(T)),thenS\^LMLpnc(T)) and 
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^\(LxU(L nc{T)) isomorphic, contradicting Observa- 

tion 3. Thus, {Lp n C{T)) c {L^ n C{T)). 

Assume that ^(eO is a minimal cut of G{V). Then, all 
the labels in Lp are in the same connected component of 
G{V) - ^(eO. By Observation 3, Lp = L^, Thus, ^'{f) is 
also a minimal cut of G(V). □ 

Lemma 17 » Let S be an AST of V and ^ be the cut 
function of S, Let Eq be the set of all edges e of S such 
that ^(e) is not a minimal cut ofG(V). Choose any edge 
= {m, v} G Eq such that \^{e'')\ = maxge^g \^(e)l Let 
y be the tree obtained from S by splitting e* at u and let 
be the cut function ofS\ We have the following, 

1. For any edge f e E{S'), if\^'(f)\ > then 
"^'(f) is a minimal cut ofG(V). 

2. Let P be the set of all edges x in S such that 
|vl/(e*)| = \^{x)\ and^(x) is not a minimal cut Let 

be the set of all edges x in such that 
|vi/(e*)| = \^f{x)\ and^f\x) is not a minimal cut 
Then, \P'\ < \Pl 

Proof (i) Consider any edge/ e E(S^) where |^'(/')| > 
If/ e E(S) n£(y), then ^^(f) = ^^'(f). Since 
l^(/^)l > by assumption ^(f) is a minimal cut of 

G(V). Thus, ^'(f) is also a minimal cut of G(V). Assume 
that / e E(S^) \ E(S), By Lemma 16, there exists an edge 
e' G E(S) where ^^(/') c vl/(e0. Since \^'(f)\ > 

> Thus, by assumption ^(eO is a mini- 

mal cut of G(V), From Lemma 16, it follows that ^(eO = 
^f'(f) and ^'(f) is a minimal cut of G(V). 

(ii) Let Q = Pn(E(S)\E(S')) and = P'n(E(S')\E(S)). 
It suffices to show that |Q^| < |Q|. Consider any edge 
/ G Q'. By Lemma 16, there exists an edge e' e E(S) \E(S') 
where ^'(f) c vi/(eO. Thus, 1^(^01 > \^'(f)\Af\^(e')\ > 
\^^(f)\, then by assumption ^(e^) is a minimal cut and 
thus by Lemma 16 |^(eOl = a contradiction. 

Thus, = ^^(f). Also, since ^f'(f) is not a mini- 

mal cut, by Lemma 16, neither is ^(eO- If e' = e*, then 
all vertices of Vy are in the same connected component 
of G(V) — ^(e), contradicting the assumption that it is 
possible to split at u. Thus, e' 7^ e*. Hence, we can 
conclude that for every edge/ G there exists an edge 
e' e{Q\ {e*}), where ^'(f) = ^f(e'). 

Let/i and/2 be any two distinct edges in Q\ Let ei and 
e2 be the edges of Q \ {e*} where ^^(/i) = ^(ei) and 
m^(f2) = VI/ (^2). If ei = 62, then ^f'(fi) = vl/^(f2), con- 
tradicting Observation 2(i). Thus, ei 7^ ^2. Since e e Q 
and e ^ Q', it follows that |Q'| < |Q| - 1, and thus 

\Q'\ < \Ql □ 

Proof of Lemma 14, Let S be an AST of V and ^ be 
the cut function of S, Do the following while S contains 
an edge e such that ^(f) is not a minimal cut of G(V): 



Pick an edge e* satisfying the conditions of Lemma 17, and 
apply a split operation at e*; let be the resulting tree. 
By Lemma 15, is also an AST of V, Let ^1/^ be the cut 
function ofS\ Set S to 5' and ^ to ^1/^ 

We only need to prove that the total number of iter- 
ations, 5, is finite. An AST of V has at most 2\C{V)\ 
vertices. Also, | ^ (e) | > 1 for any edge e of S, It thus follows 
from Lemma 17 that 5 is finite. □ 

Proof of Lemma 12, Assume that V has an AST. 
Then, by Lemma 14, V has an AST S whose cut function 
^ has the property that, for every edge e g E{S), vl/(e) is a 
minimal cut of G(V), Let be the set of all ^(e) such that 
e is an internal edge of S, Then, is a set of minimal cuts 
of G(V), Further, by definition of ^1/, for every F e T and 
for every T e V, F contains at most one edge of T Thus 
every cut in T is legal. We now prove that is a complete 
set of pair wise parallel nice minimal cuts of G(V), 

We first argue that every cut in is nice. Consider any 
F e J^, Let e = {w, v} be the internal edge of S where 
vl/(e) = F, Let T be an input tree that has an internal edge 
/ in^(e). Since e is an internal edge at least one such input 
tree exists; otherwise ^(e) is not a minimal cut. Now, by 
definition,/ is the only edge of T in ^(e), so, by Lemma 7, 
each of the two connected components of G(V) — ^(e) 
has at least one non-internal edge of T Hence, F is a nice 
minimal cut of G{V), 

To prove that the cuts in T are pairwise parallel, we 
argue that for any two distinct internal edges ei and ^2 of 
5, ^(ei) and ^(^2) are parallel. There exist vertices x e ei 
andy G e2 where Lx ^ Ly, For every edge / G ^(ei), we 
show that / G VI/ (^2) or/ c Vy, It then follows that ^(ei) 
and ^(^2) are parallel. Let / be an edge of input tree T, 
Then there exists z ef where L^ ^ Lx, Thus, L^ ^ Ly and 
z G Vy. By Lemma 13, all the vertices of Vy are in the same 
connected component of G(V) — ^fe). Thus,/ G ^fe) 
or/ c Vy, 

Lastly, we show that T is complete. Consider any inter- 
nal edge/ = {p, q) of some input tree T, Since S is an AST 
of V, there exists an edge e = {u,v} where, up to relabeling 
of sets, Lp c Lu and Lq c Ly, Thus, e is an agreement edge 
of S corresponding to/ so/ G vl/(e). Since / is an internal 
edge, e is also an internal edge of S and thus ^{e) g T, 
Hence, for every internal edge/ of an input tree there is a 
cut F ^ T where/ g F, Thus, § is complete. 

(^) Assume that there exists a complete set T of pair- 
wise parallel nice minimal cuts of G(7^) where, for every 
F e F and every T e V, F contains at most one edge of 
T, By Theorem 8, S (T) is compatible and, by Theorem 1, 
there exists an unrooted tree S where T^(J-) = ^(S), We 
prove that S is an AST of V by showing that Ti(S\c(T)) = 
E(r) for every input tree T eV, 

Consider an input tree T of V, Let Xi \X2 be the non- 
trivial split of T corresponding to edge/ G E(T), Since 
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is complete, there exists a cut F e where/ g J^, 
If (j(F) = Yi\Y2, by Lemma 7, up to relabeling of sets, 
Xi C Yi for every / e {1, 2}. Since cr(F) is a split of 5, this 
implies that ECr) c E(5|/:(r)). 

Consider any non-trivial split Pi \P2 of E (S) where Pi H 
£(7) # 0 for each / e {1,2}. Let Q,- = Pi n for 
each / e {1,2}. Since E(5') = E(J^), there exists a cut 
F e T where cr(F) = Pi\P2' Since Pi and P2 are in dif- 
ferent connected components of G(V) — F, Qi and Q2 
are also in different connected components of G(V) — F, 
Thus, F contains an edge/^ of T, Since F does not contain 
any other edge of T, a if') = Q1IQ2. Thus, E(5|£(r)) c 

E(r). □ 

Relationship to legal triangulations 

Taken together. Theorems 3 and 6 say that G(V) has a 
complete set of pairwise parallel legal minimal cuts if and 
only if it has a legal triangulation. The connection between 
legal triangulations and complete sets of pairwise parallel 
legal minimal cuts is through the existence (or nonexis- 
tence) of a compatible tree. Here we make the connection 
explicit, showing how, from a set of pairwise parallel legal 
minimal cuts, one can construct a legal triangulation of 
G(V) without going through a compatible tree. We leave 
the other direction — going from a triangulation to a set of 
cuts — to the reader. 

Let be a complete set of pairwise parallel legal 
minimal cuts of G(V), We assume that the elements 
of are ordered in some arbitrary, but fixed, man- 
ner, and that no proper subset of is also complete. 
For each F g J-", we build a pair (Xp, Yp) where Xp 
and Yp are vertex separators of G(V), and Xp, Yp c 
{u : uis the endpoint of some edge inF}. The collection of 
pairs {(Xp, Yp) : F e J^} is not unique, as it depends on 
the order in which is arranged. We say that a cut F e 
differentiates an internal edge e — {x^y} ii x e Xp and 
ye Yp, 

For each F e let = Fiji) H F for each / g[ /c], and 
let F c F denote the set of all edges e such that e g F/ for 
some i e [ with \Fi | = 1. Note that if \Fi \ > 1, all edges in 
Fi must share a common endpoint. Let Ap and Bp denote 
the two connected components of G(V) — F. 

For each cut F in we build (Xp, Yp) as follows. 

1. For each internal edge e e F: 

(a) If no cut preceding F differentiates e, add 
e n V(Ap) to Xp and e n V(Bp) to Yp. 

(b) Otherwise, suppose cut I e which 
precedes F, differentiates e. Let Q be the 
connected component of GiP) — I where 
E(Q) n F 7^ 0. (Note that Q is unique, since 1 
and F are parallel) Let v be the unique 
endpoint of e in Q. Add v to Xp and Yp. 



2. For each non-internal edge e g F, add the non-leaf 
endpoint of e to both Xp and Yp. 

3. For each / e.[k\ such that \Fi \ > 1, add the common 
endpoint of the edges of Fi to both Xp and Yp. 

By construction and the properties of every edge 
internal edge of G(V) is differentiated by some cut F G 
Further, the sets Xp and Yp have the form Xp = 
{xi, . . .,Xm>zi, . . .,Zp} 2ind Yp = {yi, . . .,ym>zi, . . .,Zp}, 
where m > 0, p > 0, and for every / e[m], {xi,yi} is an 
internal edge of G(V) that is differentiated by F. Let 

Op = [{xi, . . . ,Xj,yj, . . .,ymyZi, . . . ,Zp} :J e[m] }. 

We now state how to go from a complete set of pair- 
wise parallel legal cuts to a legal triangulation. As in 
Section 'Preliminaries', given a graph G and a collection A 
of subsets of V(G), Ga denotes the graph derived from G 
by making the set of vertices of X a clique for every X g A. 

Theorem !!♦ Let A be the collection of subsets of 
y(G(V)) given by 

A = {Ng{v) (^) '^isa leaf in G (P)}U |J ({Xp, Yp}\JOp). 

FeJ' 

(1) 

Then, G(V)a is a legal triangulation ofG(V), 

The proof of Theorem 11 relies on a series of auxiliary 
lemmas, for which we introduce some new notation. For 
each F e F\j denotes Xp U Yp and Fn denotes Xp fl 
Yp. Also, we abbreviate G(V)a to Ga> where A is the set 
defined in Equation (1). 

Lemma 18. Let F and I be two distinct cuts ofT, and let 
xbe a vertex ofF^j. Suppose x lies in the connected compo- 
nent of G(V) — I that does not contain edges ofF, Then, 
X G /n. 

Proof Let Ep^x be the set of all edges of F that contain x 
and let E^x be the set of all edges of / that contain x. We 
must have Ep^x ^ F^x ^ F If \Fix\ > h then;^ g In- Thus, 
assume that \Eix\ = 1- Ei,x = {^}> where e = {x,y}. 
Since Ep^x ^ Fj^x and \Ep^x\ > h Fp^x = {^}- We can 
assume that y is not a leaf (since, otherwise, x G /n). Let 
Ej^y be the set of edges of / with y as an endpoint. Vertex 
y lies in the component of G — F that does not contain L 
Thus, every edge in Ej^y is also present in F. If \Fj^y\ > 1, 
then there is more than one edge in F with y as an end- 
point and by construction, x ^ Fyj. Hence, |F/,y| = 1, and 
so Ei^y = {e}. 

Let / be the cut that differentiates e. If F = J then by 
construction, x e In^ Thus, assume that F 7^ /. If / is in 
the same connected component of G(V) — F as /, then, by 
construction x ^ F\j, which is a contradiction. Thus, / is 
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in the connected component of G(V) — F that does not 
contain I and, by construction, x e In- □ 

Lemma 19. Let F e For every edge {u, v} in Ga. (i) 
ifue V(Af) \ Fn, then v ^ V{Bf) \ Yp, and (ii) if u e 
V{Bf) \ Fn, then v ^ V{Af) \ Xp, 

Proof, Without loss of generaUty, we consider only the 
case where u e V{Ap) \ Fn- Suppose that v e V{Bf) \ Yp. 
If e e E(G(V)), then e e F and hence, by construction, 
at least one of u and v is in Fy- But v ^ Yf, so w g Fn> a 
contradiction. 

Thus, e must be a fill-in edge. Since e ^ Fu, there must 
be a cut 7 G J", / 7^ F, such that e c 7^. UE(Af) H 7 7^ 0, 
then by Lemma 18, v g Fn> a contradiction. Thus, assume 
that E(Bf) ni ^ 0, Then, by Lemma 18, u e Fn> another 
contradiction. □ 

A clique of Ga is illegal if it contains a fill-in edge 
with a leaf as an endpoint or it contains an internal edge 
along with any another edge of G{V). An illegal clique 
violates one of the legal triangulation conditions (LTl) 
or (LT2) stated in Section 'Display graphs and edge label 
intersection graphs! 

Lemma 20. Let Fbea cut ofT and let H be the subgraph 
ofGA induced by vertices of Fyj, Then, H is triangulated 
and contains no illegal clique. 

Proof, Let Xp — {^1, . . . , Xm^ 2:1, ... , Zp) and Yp — 
{yi> • • • >ym>Zi, . . . ,Zp}, where for every / G[m], xi G 
V(Af), yi G V{Bf) and {xuyi} is an internal edge of G{V), 
Note that Fn = [zi, • • • , Zp}, 

Claim. For every i,j e[m] where i > j, 
e = {xuyj} ^E{H), 

Proof, Assume that e g F(77). By construction of 
{Xp, Yp), e is a fill-in edge. Since no set in Op contains 
both Xi and jy, there is a cut I ^ T where e c 7u. Since 
F and 7 are parallel, only one of the two sets 7 Pi F{Ap^ 
or 7 n F(Bp) is non-empty. Assume that 7 0 F{Ap) 7^ 0. 
Then by Lemma 18, yj G Fn, a contradiction. Similarly, 
if 7 n E{Bp) ^ 0, then by Lemma 18, Xi G Fn, a 
contradiction. □ 

Let C be a chordless cycle of length at least four in 77. 
Since Xp and Yp are cliques in Ga, if C contains more than 
two vertices from one of Xp or Ip, then C must contain a 
chord. Hence, C has exactly four vertices, with exactly two 
vertices each from Xp and Yp, We will first show that Zi ^ 
C for any / G p. Assume that Zi G C for some i e[p]. Then, 
of the remaining three vertices of C, at least two of them 
belong to one oiXp and Yp. Let a^ b be those two vertices. 



Without loss of generality assume that {a, b} c Xp, Since, 
^ Xp, vertices Zi, a, b form a clique in 77. Thus, C is not 
chordless, a contradiction. 

Let Xi, Xj be the vertices of Xp in C where I < i < j < 
m. Similarly, let j/^y/ be the vertices of Yp in C where 
1 </'</< m. Now, either / < i^ or / > i\ If / < i\ 
then {xi, . . . , Xi,yi, . . .,ym> Zi, . . Zp] e Op and thus ver- 
tices xuyt'.yf form a clique. Hence, C is not chordless, a 
contradiction. If / > i' , then from the above claim neither 
of the edges {xi.yi'} and {xj.yi'} can exist. Thus, vertex;/// 
cannot be in C, a contradiction. Hence, 77 does not contain 
a chordless cycle and is triangulated. 

Assume that 77 contains an illegal clique 77^; that is, 77^ 
contains two internal edges e and e' , By construction, Fy 
cannot contain a leaf. By the legality of F and the construc- 
tion of Fu, edges e and e^ are from different input trees 
and both are differentiated by F. Let e = {xi,yi} for some 
/ e[m] and let e^ = {xj,yj} for some / e[m]. Without loss 
of generality, assume that / < By the above claim, there 
is no edge between xj and yi in 77; thus, 77^ is not a clique, 
a contradiction. □ 

Lemma 21. Ga is chordal. 

Proof, Assume the contrary. Let C be a chordless cycle 
of length at least four in Ga- By construction, C cannot 
contain a leaf. There are two cases. 

Case 1: There are vertices u,v e V(C) and a cut F e T 
where u e Xp\Fn and v g \ Fn. 
We have two subcases. 

(a) Suppose C contains a vertex x G Fn. Then, there 
exists a path u, x, v in C Because C is a cycle, there 
must exist an edge between a vertex u' e V(Ap) \ x 
and v^ G V(Bp) \ x. Since C is chordless, u^ ^ Fn and 
V i Fn. Thus, u' g V(Ap) \ Fn and v' g V(Bp) \ Fn. 
By Lemma 19, if u^ G ViAp) \ Xp then there is no 
edge between u^ and v\ Thus, u^ e Xp\Fn. 
Similarly, G Yp \ Fn. If u u^ or v v' , C cannot 
be chordless. Thus, u = u! and v = v' and C has 
length three, a contradiction. 

(b) Suppose C does not contain a vertex ofFn. Since 

u G V(Ap) \ Fn, V G V(Bp) \ Fn and F is a cut, there 
must exist two edges ei = {xi.yi} and e2 = {X2>y2} 
in C where {;vi,»%2} ^ ViAp) \ Fn and 
{yi.yi} c V(Bp) \ Fn. If XI G V(Ap) \ Xp, then by 
Lemma 19 there cannot exist an edge between xi 
andyi. Thus, xi e Xp\Fn. Similarly, X2 ^ Xp\ Fn 
and {^1,^2} ^ Yp\Fn. Since Xf and Yp are cliques 
in Ga, there exist edges {xi,X2} and {3/1,3/2}- Thus, 
there cannot exist any other vertex in C and hence 
V(C) c F\j. But, by Lemma 20 subgraph of Ga 
induced by vertices of Fy is triangulated. Thus, C is 
not chordless, a contradiction. 
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Case 2: There is no cut F ^ T with vertices u e Xp \Fn 
and V G \ Fn ^uch that u,v e V(C), Thus, for every 
cut F G J-" at most two vertices of V(C) are in Fy- Let 
xi,X2>X3,X4 be a path of length four in C For every / G 
{1, 2, 3}, let F^^ eThe the cut where c F^\ We 

will first show that such cuts exist and are distinct. 

Recall that every vertex in C is internal. Also, C does 
not contain any edge e = {x, y} from G; otherwise, there 
would be a cut F' that differentiates e, contradicting the 
assumption for case 2. Since every edge in C is in Ga, it 
must be the case that for every edge e in C there exists a 
cut F where e c Fy . Also, at most two vertices of C are in 
Fu. Thus the cuts F^^), F^^) ^(3) 

are distinct. 

To simplify notation, for each / G {1, 2, 3} let A/ = Apu) 
and Bi = Bpio . Without loss of generality, assume that 
E(Ai) n F(2) ^ 0 and £(82) H F^^) 7^ 0. There are three 
possibilities. 

(a) Suppose F^^^ n £(^2) 7^ 0. If ^1 G A2, then by 
Lemma lS,xi ^ ^ chordless, a 
contradiction. Thus, xi e B2. Similarly, if g by 
Lemma 18, ^4 G F^ ^ and C is not chordless, a 
contradiction. Thus, X4 e A2. Since C is a cycle, F^-^^ 
is a minimal cut and {Fy \ {:V2>^3}} ^ ^(O = 0, 
there exists an edge {vi, V2} in C where 

vi G V(A2) \ F^f^ and V2 G 1/(^2) \ i'u ^ But, by 
Lemma 19, such an edge cannot exist. 

(b) Suppose F^^^ nF(Ai) ^ 0 andF^^> nF(52) 7^ 0. 
Without loss of generality, assume that A3, B3 contain 
F^^^ and F^^^ respectively. Assume that X2 G A3. 
Since c\:2 ^ ^u^\ by Lemma 18, X2 G Fp^\ Then, there 
exists an edge {;v:2,^4} and C is not chordless, a 
contradiction. Thus, X2 G ^3. But X2 ^ Fy and thus, 
by Lemma 18, cv2 ^ F^ \ Hence, there exists a chord 
{jV2,:V4} and C is not chordless, again a contradiction. 

(c) Suppose F^^^ n F(^i) 7^ 0. Renaming vertices ;V2> 
X3 and CV4 as, CV4, xs, X2 and cvi, respectively, brings us 
back to subcase 2(b). 

Thus, Ga does not contain a chordless cycle of length 
four or greater; hence, Ga is chordal. □ 

Proof of Theorem 11, Lemma 21 states that Ga is tri- 
angulated. We now prove that Ga is a legal triangulation; 
i.e., that it satisfies conditions (LTl) and (LT2) of Section 
'Display graphs and edge label intersection graphs'. 

Condition (LT2) holds for Ga, because our construction 
adds no fill-in edge incident on a leaf. Now suppose that 
Ga violates (LTl); i.e., Ga has a clique // with two internal 
edges e = ji} and e' = {»%2> 72}- Let F be the cut that 
differentiates Assume that G V{Ap) andyi g V{Bf). 
By Lemma 20, Fy does not contain both endpoints of e' . 
Without loss of generality, assume that X2 ^ Fy and X2 G 



A. Since X2 ^ Fy and yi ^ Fn> by Lemma 19, there is no 
edge between X2 and yi in Ga- Thus, H is not a clique 
of Ga, a contradiction. Hence, Ga satisfies (LTl) and is 
therefore a legal triangulation of G(V), □ 

Conclusion 

We have shown that the characterization of tree compati- 
bility in terms of restricted triangulations of the edge label 
intersection graph transforms into a characterization in 
terms of minimal cuts in the display graph. These two 
characterizations are closely related to the legal triangu- 
lation characterization of [8]. We also derived character- 
izations of the agreement supertree problem in terms of 
minimal cuts and minimal separators of the display and 
edge label intersection graphs respectively. 

It remains to be seen whether any of our characteriza- 
tions can lead to explicit fixed-parameter algorithms for 
the tree compatibility and agreement supertree problems 
when parametrized by the number of trees. Indeed, as of 
yet, the fixed-parameter tractability of agreement remains 
open. 

We close with some remarks on characterizations of two 
problems related to compatibility. A profile V defines a 
tree 5 if 5 is the only compatible supertree for V, V iden- 
tifies a tree 5 if 5' is a compatible supertree for V and every 
other compatible supertree for V displays S. Grunewald 
et al. [16] use quartet graphs to characterize when a profile 
consisting of quartet trees defines or identifies a tree. An 
interesting question is whether similar characterizations 
can be derived for arbitrary profiles using display graphs 
or edge label intersection graphs. Along these lines, we 
note a connection between complete sets of cuts and the 
question of whether a profile defines a tree, which was 
pointed out by one of the reviewers. To explain it, we need 
some definitions ([10], p. 131). Let F be a tree and let 
q = xy\wz\ie2i quartet tree displayed by F. Quartet tree q 
distinguishes an interior edge e of F if e is the only interior 
edge such that {x,y\ and {w,z\ are in different connected 
components oiT — e. Now, let S and T be two trees such 
that S displays F. An interior edge e of F distinguishes an 
interior edge/ of S if there exists a quartet q such that e 
and/ are both distinguished by q. Suppose P is a profile 
in which there is at least one taxon in common among all 
input trees. Then, V defines a tree S if and only if V is com- 
patible and every interior edge of S is distinguished by an 
interior edge of at least one tree in V ([10], p. 133). Now, 
recall that if is a complete set of cuts of G(V), then, for 
every tree F/ G V and every internal edge e of F/, there is 
some cut F e T in which e is the only edge of F/. Thus, if 
V is compatible, e must be a distinguishing edge for some 
internal edge of a supertree for V. This observation could 
lead to a cut-based characterization of definability analo- 
gous to known triangulation-based characterizations (see 
[10], p. 79). 
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